Hybrid Learning Framework for Large-Scale Web Image Annotation and Localization

نویسندگان

  • Yong Li
  • Jing Liu
  • Yuhang Wang
  • Bingyuan Liu
  • Jun Fu
  • Yunze Gao
  • Hui Wu
  • Hang Song
  • Peng Ying
  • Hanqing Lu
چکیده

In this paper, we describe the details of our participation in the ImageCLEF 2015 Scalable Image Annotation task. The task is to annotate and localize different concepts depicted in images. We propose a hybrid learning framework to solve the scalable annotation task, in which the supervised methods given limited annotated images and the searchbased solutions on the whole dataset are explored jointly. We adopt a two-stage solution to first annotate images with possible concepts and then localize the concepts in the images. For the first stage, we adopt the classification model to get the class-predictions of each image. To overcome the overfitting problem of the trained classifier with limited labelled data, we use a search-based approach to annotate an image by mining the textual information of its similar neighbors, which are similar on both visual appearance and semantics. We combine the results of classification and the search-based solution to obtain the annotations of each image. For the second stage, we train a concept localization model based on the architecture of Fast R-CNN, and output the top-k predicted regions for each concept obtained in the first stage. Meanwhile, localization by search is adopted, which works well for the concepts without obvious objects. The final result is achieved by combing the two kinds of localization results. The submitted runs of our team achieved the second place among the different teams. This shows the outperformance of the proposed hybrid two-stage learning framework for the scalable annotation task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Heuristic Label Set Relevance Learning for Image Annotation

Automatic annotation can automatically annotate images with semantic labels to significantly facilitate image retrieval and organization. Traditional web image annotation methods often estimate specific label relevance to image, and neglect the relevance of the assigned label set as a whole. In this paper, A novel image annotation method by heuristic relevance learning is proposed. Label releva...

متن کامل

Automatic Web Image Annotation via Web-Scale Image Semantic Space Learning

The correlation between keywords has been exploited to improve Automatic Image Annotation(AIA). Differing from the traditional lexicon or training data based keyword correlation estimation, we propose using Web-scale image semantic space learning to explore the keyword correlation for automatic Web image annotation. Specifically, we use the Social Media Web site: Flickr as Web scale image seman...

متن کامل

Web Image Search by Automatic Image Annotation and Translation

There has been a growing interest in implementing online Web image search engine in the semantic level. However, most existing practical systems including popular commercial Web image search engines like Google and Yahoo! are either textbased or a simple hybrid of texts and visual features. This paper proposes a novel Web Image Search by Automatic Image Annotation and Translation (WISAIAT) syst...

متن کامل

A Scalable Bootstrapping Framework for Auto-Annotation of Large Image Collections

Image annotation aims to assign semantic concepts to images based on their visual contents. It has received much attention recently as huge dynamic collections of images/videos become available on the Web. Most recent approaches employ supervised learning techniques, which have the limitation that a large set of labeled training samples is required for effective learning. This is both tedious a...

متن کامل

Self-Transfer Learning for Fully Weakly Supervised Object Localization

Recent advances of deep learning have achieved remarkable performances in various challenging computer vision tasks. Especially in object localization, deep convolutional neural networks outperform traditional approaches based on extraction of data/task-driven features instead of handcrafted features. Although location information of regionof-interests (ROIs) gives good prior for object localiz...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015